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±..*W Data Transmission 

This invention is concerned with transmission of data from a sending station to a 
receiving terminal, over a telecommunications network. More particularly, it envisages a 
situation where the transmitted bit-rate that the network can support fluctuates, typically due to 
5 the use of a congestion control mechanism. For example, the TCP/IP. system uses IP (Internet" 
Protocol) for transport. This is a connectionless service and simply transports transmitted 
packets to a destination. TCP (Transmission Control Protocol) is an overlay to this service and 
brings in the idea of a connection; the sending station transmits a packet and waits for an 
acknowledgement from the receiving terminal before transmitting another (or in the event of no 

10 acknowledgement within a timeout period, it retransmits the packet). More importantly (in the 
present context) it embodies a congestion control algorithm where it begins with a small packet 
size and progressively increases the packet size until packet loss occurs, whereupon it reduces 
the size again. After this initial "slow start" phase, the algorithm continues to increase the packet 
size (albeit more gradually) backing off whenever packet loss occurs; necessarily this involves 

15 some cyclic variation of the packet size. A description of TCP is to be found in "Computer 
Networks", by Andrew S. Tanenbaum, third edition, 1996, pp. 521 - 542. 

Another common protocol is UDP (User Data Protocol). This does not have a congestion 
control mechanism of its own, but it has been.proposed to add one to it, the so-called "TCP- 
Friendly Rate Protocol (TFRC) described in the Internet Engineering Task Force (IETF) 
20 document RFC3448. This aims to establish an average transmitting data rate similar to that 
which the TCP algorithm would have achieved, but with a smaller cyclic variation. It too 
exhibits the same "slow start" phase. .]/■ 

One drawback of this slow start system is that the transmitting station will not "know" 
what bit-rate the network will provide until the slow start phase is complete - which may take 

25 (depending on the round-trip time of the network) as much as several seconds. For some 

applications this does not matter, but in others it does: for example, when streaming video from a 
server which provides a choice of compression rates, the server cannot make an informed 
decision at the outset about which rate to choose. In the past, one method of dealing with this 
has been that the server starts by transmitting the lowest quality stream and switches up to a 

30 higher rate if and when it finds that the network can support it. 

It should be stressed that the invention does not require that either of the two protocols 
discussed above should be used; it does .however start from the assumption that one is to transmit 
over a connection the bit-rate of which does not become apparent until after transmission has 
begun. 

35 According to one aspect of the present invention there is provided a method of 

transmitting!!^^ 

the data comprise a first part and at least two alternative second parts corresponding to respective 
different resolutions, for presentation at a receiving terminal simultaneously With the first part, 
comprising: 

40 (a) transmitting at least an initial portion of the first part; 

(b) receiving data indicative of the available transmission capacity; 

(c) choosing among the alternative second parts, as a function of the data indicative of the 
available transmission capacity; 

(d) transmitting the chosen second part and any remainder of the first part. . 
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Other aspects of the invention are defined in the claims 

Some embodiments of the invention will now be described, by way of example, with 
reference to the accompanying drawings, in which: 

Figure 1 is ablock diagram of a transmission system in accordance with one embodiment 
of the invention; and 

Figure 2 is a flowchart illustrating the operation of the server shown in Figure 1 . 

file A stored in a store 10, along with several versions VI, V2, V3 of the video, en 
different compression rates. • 

to the rate of audio or video material. This refers to the refe rences at 
by the original encoder, which (apart from P^ bte f ^*^^^d by the ultimate 
the transmitting end) is also equal to the average rate ^^^^e varies according 
decoder. Even in constant bit-rate video ^P?**^ . 
to picture content but is smoothed to a constant rate by the use « statkm 
bit-rate" we mean the actual rate at which data are transmitted by the transmitting . 

For thepurposes of represent example, ^^^^^^^ 
encoded by means of some suitable compression algorithm at 4.8 Kbit/s, wjxlk, 

are encoded at 10. M -d «J£ s respec ^^^t^L. 
known encoding standards such as the ITU H.261, xi.-oo, ot one 

The server 1 has a TCP interface Unconnected by 
the internet. The TCP interface is entirely conventual and ^^^idTcUtrol 
an input 1 11 for data, an output 1 12 for sendrng data j£*^ftT££toSte deliver 
output 1 13 which indicates to toe remainder f**™£*Z£ ^Teo data tore the store 
tether data to the input 1 11. A ™* &E5£ 1 f delivered to the input is 

10, and to deliver it to the input 111 of the TCP interlace it. ine 
also monitored by a unit 14 whose function will he desonbed later. There is also hm 

information about the available tranarmthng rate that ^CP mtertac^l 
nerwo*, and in — ^trSSr^ of *is server, 

choice as to audit • 

complete (or at least, has progressed sufficiently to enable avweo r 

purpose of the rate monitoring unit 14 is to ^^^^^^^SS to) a timer 
essence, it counts the bits delivered to the Y^^^Z^^P^ This 
so that it can calculate the actual transmitting bit rate ^at this number ofbits p ons Qf 

[0 measurement could be made over one round-tnp time bv£ murder to smooto ^ 
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The system operation will now be described with reference to the flowchart shown in 
Figure 2. Operation begins at Step 400, where a parameter R^ representing the previous current 
transmitting rate is initialised to a high value, and the timer 15 is reset. At Step 401 the control 
unit tests the interface output 113 until it is clear to send data. Once this test is passed it reads 
(Step 402) audio data from the file A in the store 10 and transfers this to the interface 1 1 . The 
interface 1 1 transmits this in accordance with normal TCP. 

The control unit then interrogates the output of the monitoring unit 14 and performs some 
tests of the value of the current transmitting bit-rate R T and also of the timer 15 (although 
naturally it is not to be expected that these tests will be passed on the first iteration). Thus if . 
(Step 403) the transmitting rate exceeds the rate needed to transmit audio plus full rate video (i.e. 
44.8 kbit/s), further monitoring of the slow start phase becomes unnecessary and the process 
jumps to Step 408 (described below). If not, then at Step 404 R T is tested to determine whether it 
exceeds its previous value. If so it is assumed that the slow start phase is stilLin progress. Rp is 
set equal to R T in Step 405 and the process is repeated from Step 401. If however R T < R P then 
the slow start phase is deemed to have ended. Rp is set equal to R T in Step 406 and the process 
moves on to a second phase. In the case of high round-trip times on the network, it can take a 
long time for the slow-start mechanism to conclude, and therefore also a test at Step 407 checks 
the state of the timer 1 5 and if this exceeds a maximum permitted waiting time the process 
moves on to the second phase where the video rate decision is then made on the basis of the 
known available transmitting bit-rate, even though this might not be the maximum. 

This second phase begins with the control unit making, at Step 408, a decision as to 
which video rate to use. In this example, it chooses the highest rate that, with the audio, 
represents a total bit-rate requirement less than or equal to R T , viz.: 

ifR T > 44.8 choose V3 

if 44.8 > R T > 24.8 choose V2 

if 24.8 > R T > 14.8 choose V3 

if R T < 14.8 5 transmission is not possible; exit at Step 409. 

Once this decision has been made, the control unit then proceeds at Step 410 to read 
video data from the relevant file to the TCP interface 11. It should be stressed that the initial 
part of this video data is contemporary (in terms of the original recorded material ) with the audio 
already sent. Inherent in Step 410, but conventional and hence not shown. explicitly, are flow 
control (analogous to Step 401), flow control feedback from the receiving terminal (so that the 
amount of data received does not cause buffer overflow) and the possibility of switching to a 
higher or lower rate video file in the event that network conditions improve or deteriorate, 
respectively. 

One issue that should be considered, though, is the fact that, because, during the start-up 
phase, only audio has been sent, the audio buffer at the receiving terminal is ahead of the video 
buffer. This may be considered desirable (to a degree at least) in providing a greater resilience to 
short-term network problems for the audio than for the video, so that in the event of packet loss 
causing video buffer underflow and hence loss of video at the receiving terminal, the user may 
continue to listen to the accompanying sound. But, if desired, the video streaming Step 410 may 
temporarily, during an initial period of this second phase, slow down or even suspend 
transmission of audio data, until the contents of the audio and video buffers at the receiving 
terminal reach either equality (in terms of playing time) or some other specified relationship. 
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1 "naturally this has the benefit of increasing the amount of video data that can be sent during this^B| 
initial period. 

Possible modifications to the arrangements shown in Figure 1 include: 

(a) The use of aUDP interface, with TFRC congestion control, as discussed in the 

5 introduction, in place of the TCP interface 1 1. In this case, because TFRC explicitly calculates 
the actual transmitting rate, it may be possible to dispense with the monitoring unit 1 3 and 
instead read the transmitting rate R T directly from the UDP/TFRC interface. Recognition of the 
end of slow start may still be performed as shown in Step 404 of the flowchart by comparing R T 
and Rp; alternatively it may be possible to recognise it by observing when the packet loss reaches 

10 a specified level. 

(b) The above description assumed that one would choose the highest video rate that the 
network would support; however the option also exists of deliberately choosing a lower rate m 
order to reduce or even eliminate the delay that occurs at the receiving terminal while the 
receiver video buffer is being filled to an acceptable level. Such measures are discussed m our 

15 international patent application no. PCT/GB 01/05246 [Agent's Ref. A26079]. 

(c) The above description assumed that the video and audio data originated from stored 
files However this method may be applied to the transmission of a live feed, provided that the 
server includes additional buffering so that the video can be held at the server during the initial 
period of audio-only transmission. 

20 (d) Alternative audio rates can be accommodated provide a criterion can be found 

~ whereby a decision between them can be made without recourse to any information about current 
network conditions. An example of this might be of an internet service that ^ be accessed via 
different access routes having vastly different bandwidths, perhaps via a standard 
telephone line and a 56kbit/s modem on the one hand and an ADSL connection at 500 kbit/s on 
the other. If the system has two alternative audio rates, say 4.8 kbit/s and 16 kbit/s and one 
makes the reasonable assumption that the PSTN connection can never support the higher rate and 
the ADSL connection always can, then if the server is informed by the receiving terminal (either 
automatically or manually) as to which type of access line is in use it can make a decision ot 
which of the two audio rates to choose, based on this information alone. Once that decision has 
30 been made, the process can proceed in the manner already described. 

In principle, the streaming method we have described will work with a conventional 
receiver. However, the benefits of the proposed method will be gained only if the receiver has 
the characteristic that, before beginning to play the received material, it waits until both its audio 
buffer and its video buffer contain sufficient data. In general, established video standards do not 
35 specify this functionality, leaving it to the discretion of the receiver designer. Of the receivers 

cuiren tly^vailable r some-have-this^^ 

decode and play audio as soon as the audio buffer is adequately full, even when no video data 
have arrived. We recommend that either one chooses a receiver of the former type, or one 
modifies the receiver control function so as to monitor the buffer contents and to initiate playing 
40 only when both buffers contain sufficient data (in accordance with the usual catena) to support 
continuous playout. 

A second embodiment of the invention will now be described. This is simi lar to the first, 

except"that IFuseT layeFed^ideo coaihg. ThatTsTo sayTinstead-ot having social (threerin-the- 

gAU ^ J _ . i r„,ui~u ;o conf rmp hac a base 



25 



except tnai u uses laycrcu viucu ^uuig. — > ~ , , 

above example) different versions of the video source only one of which is sent, one has a base 
45 layer source, which can be decoded by itself to give a low-quality video output and an 
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enhancement layer which is useless by. itself but can be decoded together with the base layer to 
produce a higher quality video output; and one may have further enhancement layers each ot 
which is usable oniy'in combination with the base layer and the intervening layer(s). In this 
example we also suppose that multiple (non-layered) audio rates are available. We recall that in 
the slow-start phase one has to transmit data in advance of deciding between the various 
alternative sources, and that the rationale of choosing to transmit the audio first was that, since 
there was only one audio rate one knew that this would inevitably have to be transmitted, 
whatever the rate decision. In this second example with alternative audio rates this ceases to be 
the case, since neither audio stream qualifies as "always to be transmitted". However the video 
base layer does so qualify, and thus in this case one proceeds by commencing transmission of the 
video base layer in Step 402. Then in step 408 one selects the video and audio rates to be used 
and in Steo 410 commences transmission of the selected audio and the enhancement layer(s), it 
anv appropriate to the chosen video rate. In this' instance, when transmitting enhancement .ayer 
video it would be appropriate to cease transmitting base layer video until all the enhancement 
15 . layer video contemporary with the base layer video already sent has been transmitted. . 

Of course, if only a single audio rate were used, then both audio and base layer video 
could be sent during the slow-start phase. . v . 

A third embodiment, similar to thesecond, uses video encoded using frame rate 
scalability in accordance with the MPEG4 standard. An encoded MPEG sequence consists of I- 
frames (encoded using intra-frame coding only), P-frames (encoded using inter-frame : differential 
coding based on the content of a preceding I-frame) and B-frames (encoded using bi-directional 
inter-frame prediction based on neighbouring I and P-frames). A typical MPEG sequence might 
be IBBPBBPIBBP etc. In frame rate scaleable coding one transmits for the lowest b *" rate 
stream just the I-frames; for a higher bit-rate stream, the I and P-frames, and for a higher still bit- 
rate, all the frames. In this instance one proceeds by transmitting only I-frames at Step 4U2 
during the slow-start phase. 

A yet further example is the transmission of a page for display (a « Veb page") consisting 
of text and graphics. The idea here is slightly different from the preceding examples ui that we 
are not now concerned with the transmission of material that has to be presented to the user m 
real time. Nevertheless, it is considered advantageous to provide, as we provide here, tor 
alternative graphics resolutions. So the store 10 contains text, for example in the form of an htal 
file and separate image files corresponding to one or more images which the receiving germinal 
is to display, in conventional manner, as part of a composite display. For each image there are 
several, perhaps three, image files stored in the store 10, at three different resolutions The text, 
or the beginning of it, is transmitted in Step 402 during the slow-start phase. At Step 408 a ■ 
decision is made, based on the available transmitting rate R T , as to which resolution to choose, 
theideai ?£ ing-thaUme JC hoos.es jtheJbighestresolufixai.mat^llQWS theima ges to be tr ffljmittedjn_ 
"a reasonable period. The exit at 409 is not necessary in this case. Then at step 410 the remaining 
text (if any) is transmitted, followed by the file of the chosen resolution *he or ea ch image. It 
the chosen files are renamed with filenames corresponding to those embedded m the text 
independent of the resolution) then no modification at the receiving terminal is necessary ana 
can display the composite page using a standard web browser. 
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Claims 

1 . A method of transmitting data over a network having initially undetermined 
transmission capacity, in which the data comprise a first part and at least two alternative second 

5 parts corresponding to respective different resolutions, for presentation at a receiving terminal 
simultaneously with the first part, comprising: 

(a) transmitting at least an initial portion of the first part; 

(b) receiving data indicative of the available transmission capacity; 

(c) choosing among the alternative second parts, as a function of the data indicative of the 
10 available transmission capacity; 

(d) transmitting the* chosen second part and any remainder of the first part. 

2. A method* according 'tp claim 1 or 2 in which, in an initial time period of step (d), 
transmission of a leading part of the chosen second part of an extent corresponding to the extent 

15 of the first part already-transmitted is performed preferentially to, or to the exclusion of, further 
transmission of the first part. 

3. A method according to claim 1 or 2 in which the first part is data representing an 
audio signal and the alternative second parts are alternative sets of video data, encoded at 

20 respective different compression rates, for presentation simultaneously with the audio data. 




25 



4. A method according to claim 1 or 2 in which the first part is the base layer data of 
layered coded video and the second part(s) are one or more enhancement layers of the layered 
coded video. 



5. A method according to claim 1 or 2 in which the first part is text and the 
alternative second parts are alternative versions, having different resolutions, of a graphical 
image to be displayed alongside the text. 



30 6. A method of transmitting data over a network having initially undetermined 

transmission capacity, in which the data comprise a first part and at least one second part, for 
presentation at a receiving terminal simultaneously with the first part, comprising: 

(a) transmitting at least an initial portion of the first part; 

(b) receiving data indicative of the available transmission capacity; 

35 (c) choosing, as a function of the data indicative of the available transmission capacity, 

among the options of transmitting no second part, or one or more of the second parts; 

(d)_txarismitting the.chosen-second.part^^ifany-y-and the remainder-otthe-firstpart, 
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